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CAGCGTCAGACGCAGGGCACTGAGAATGTGCGACAGCGCGCAACGATGAAGTAGCCCAGAGGGTCCCTTG 
GAAAATGAGGCCAGGGTCCCTGCTGCTGCTTGTTCTGCTGCTCGCCCTGTCCAGGAGCCTGCGGGGCAAA 
GAGTGTGCGTCTCCACCCTGTGAGTGTCACCAGGAGGACGACTTCAGAGTCACCTGCAAGGAGCTCCACC 
GAATCCCCAGCCTGCCGCCCAGCACCCAGACTCTGAAGCTCATCGAGACTCATCTGAAGACCATACCCAG 
TCTTGCATTTTCGAGTCTGCCCAATATTTCCAGGATCTATTTATCTATAGATGCAACTCTGCAGCGGCTG 
GAACCACATTCTTTCTACAATTTGAGTAAAATGACTCACATAGAAATCCGGAACACCAGAAGCTTAACCT 
ATATAGACCCTGATGCCTTGACAGAGCTCCCCTTGCTCAAGTTTCTTGGCATTTTCAATACTGGACTTAG 
AATATTCCCTGACTTGACCAAAATTTATTCCACGGACATATTCTTTATACTTGAAATCACAGACAACCCT 
TACATGAGTTCGGTCCCTGAAAACGCATTCCAGGGCCTATGCAATGAAACCTTGACCCTGAAACTGTACA 
ACAATGGATTTACTTCAGTCCAAGGACATGCTTTCAATGGAACAAAGCTGGATGCTGTTTACCTAAACAA 
GAATAAATACCTGACAGCTATAGACAACGATGCCTTTGGAGGAGTATACAGTGGACCAACTTTGCTAGAT 
GTGTCTTCCACCAGCGTCACTGCCCTTCCTTCCAAAGGCCTGGAGCACCTCAAAGAACTGATCGCAAAAG 
ACACCTGGACTCTCAAAAAGCTCCCGCTGTCGTTGAGTTTCCTCCACCTCACTCGGGCTGACCTCTCTTA 
H» CCCGAGCCACTGCTGCGCTTTTAAGAACCAGAAGAAAATCAGGGGAATCCTGGAGTCTTTGATGTGTAAT 
O GAGAGCAGTATCCGGAACCTTCGTCAAAGGAAATCAGTGAACATCTTGAGGGGTCCCATCTACCAGGAAT 
Q ATGAAGAAGATCCGGGTGACAACAGTGTTGGGTACAAACAAAACTCCAAGTTCCAGGAGAGCCCAAGCAA 
J; CTCTCACTATTACGTCTTCTTTGAAGAACAAGAGGATGAGGTCGTTGGTTTCGGCCAAGAGCTCAAAAAT 
||| CCTCAGGAAGAGACTCTCCAAGCCTTCGAGAGCCACTATGACTACACGGTGTGTGGGGACAACGAGGACA 
tfl TGGTGTGTACCCCCAAGTCGGACGAGTTTAACCCCTGTGAAGATATCATGGGCTACAGGTTCCTGAGAAT 
«I CGTGGTGTGGTTTGTCAGTCTGCTGGCTCTCCTGGGCAATATCTTCGTCCTGCTCATTCTGCTAACCAGC 

MM* 

CACTACAAATTGACCGTGCCGCGGTTCCTCATGTGCAACTTGGCCTTTGCAGATTTCTGCATGGGGGTAT 
+< ACCTGCTTCTCATTGCCTCTGTAGACCTGTACACACACTCTGAGTACTACAACCACGCCATCGACTGGCA 
» GACGGGCCCTGGGTGCAACACGGCTGGCTTCTTCACTGTTTTCGCCAGTGAGTTATCAGTGTACACACTG 
N* ACGGTCATCACCCTGGAGCGATGGTACGCCATCACCTTCGCCATGCGCCTGGATAGGAAGATCCGCCTCA 
p GGCACGCGTACACCATCATGGCTGGGGGCTGGGTTTCCTGCTTCCTTCTCGCCCTGCTCCCGATGGTGGG 
j|J AATCAGCAGCTATGCCAAGGTCAGCATCTGCCTGCCAATGGACACCGACACCCCTCTTGCACTCGCATAC 
S ATTGTCCTCGTTCTGCTGCTCAATGTTGTTGCCTTTGTTGTCGTCTGTTCCTGCTATGTGAAGATCTACA 
TCACGGTC CGAAATCC C CAGTACAACC CTCGAGATAAAGACAC CAAGATTGC CAAGAGGATGGCTGTGTT 
[T GATCTTCACTGACTTCATGTGCATGGCGCCCATCTCCTTCTATGCGCTGTCGGCACTTATGAACAAGCCT 
CTAATCACTGTTACTAACTCCAAAATCTTGTTGGTTCTCTTCTACCCCCTCAACTCCTGTGCCAATCCGT 
TTCTCTATGCTATTTTCACCAAGGCCTTCCAGAGGGACGTGTTCATCCTGCTCAGCAAGTTTGGCATCTG 
CAAACGCCAGGCCCAGGCCTATCAGGGTCAGAGAGTCTGTCCCAACAATAGCACTGGTATTCAGATCCAA 
AAGATTCCCCAGGACACGAGGCAGAGTCTCCCCAACATGCAAGATACCTATGAACTGCTTGGAAACTCCC 
AGCTAGCTCCAAAACTGCAGGGACAAATCTGAGAAGAGTATAAGCAAACAGCCTTGTAAAGGAAAGGCTA 
CGCTAGTCACAGTGAGACTTACAAAAGGCTGGTTTCTTGAACATGCGTTCCAGTCCCGTGACATGTGAAC 
ACATAGGTTCATGCAGGTGATGATTCATAGGGTCAGAGTTCATCTCTAGAAAGTATTGCCTC (SEQ ID NO:l) 
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MRPGSLLLLVLLLALSRSLRGKECASPPCECHQEDDFRVTCKELHRIPSLPPSTQTLKLIETHLKTIPSLAFSSLPM 
ISRIYLSIDATLQRLEPHSFYNLSKMTHIEIRNTRSLTYIDPDALTELPLLKPLGIFNTGLRIFPDLTKIYSTDIFF 
ILEITDNPYMTSVPENAFQGLCNETLTLKLYNNGFTSVQGHAFNGTKLDAVYLNKtTKYLTAIDNDAFGGVYSGPTLL 
DVSSTSVTALPSKGLEHLKELIAKDTWTLKKLPLSLSFLHLTRADLSYPSHCCAFKNQKKIRGILESLMCNESSIRN 
LRQRKSVNILRGPIYQEYEEDPGDNSVGYKQNSKFQESPSNSHYYVFFEEQEDEWGFGQELKNPQEETLQAFESHY 
DYTVCGDNEDMVCTPKSDEFNPCEDIMGYRFLRIVWFVSLLALLGNIFVLLILLTSHYKLTVPRFLMCNLAFADFC 
MGVYLLLIASVDLYTHSEYYNHAIDWQTGPGCNTAGFFTVFASELSVYTLTVITLERWYAITFAMRLDRKIRLRHAY 
TIMAGGWSCFLLALLPMVGISSYAKVSICLPMDTDTPLALAYIV^ 

DKDTKI AKRMAVL I FTDFMCMAP I S FYALS ALMNKPL I TVTNS KI LLVLF YPLNS CANPFLYAI FTKAFQRDVF ILL 
SKFGICKRQAQAYQGQRVCPNNSTGIQIQKIPQDTRQSLPNMQDTYELLGNSQLAPKLQGQISEEYKQTAL (SEQ 
ID NO: 2) 
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underlined = deleted in targeting construct 

t ] = sequence flanking Neo insert in targeting construct 



[CAGCGTCAGACGCAGGGCACTGAGAATGTGCGACAGCGCGCAACGATGAAGTAGCCCAG 

AGGGTCCCTTGGAAAATGAGGCCAGGGTCCC] TGCTGCTGCTTGTTCTGCTGCTCGCCCT 

GTCCAGGAGCCTGCGGGGCAAAGAGTGTGCGTCTCCAGCCTGTGA [GTGTCACCAGGAGG 
ACGACTTCAGAGTCACCTGCAAGGAGCTCCACCGAATCCCCAGCCTGCCGCCCAGCACCC 

AGACTCT ] GAAGCTCATCGAGACTCATCTGAAGACCATACCCAGTCTTGCATTTTCGAGT 
CTGCCCAATATTTCCAGGATCTATTTATCTATAGATGCAACTCTGCAGCGGCTGGAACCA 
CATTCTTTCTACAATTTGAGTAAAATGAC TCACATAGAAATCCGGAACAC CAGAAGC TTA 
ACCTATATAGACCCTGATGCCTTGACAGAGCTCCCCTTGCTCAAGTTTCTTGGCATTTTC 
AATACTGGACTTAGAATATTCCCTGACTTGACCAAAATTTATTCCACGGACATATTCTTT 
ATACTTGAAATCACAGACAAGCCTTACATGACTTCGGTCCCTGAAAACGCATTCCAGGGC 
CTATGCAATGAAACCTTGACCCTGAAACTGTACAACAATGGATTTACTTCAGTCCAAGGA 
CATGCTTTCAATGGAACAAAGCTGGATGCTGTTTACCTAAACAAGAATAAATACCTGACA 
GCTATAGACAACGATGCCTTTGGAGGAGTATACAGTGGACCAACTTTGCTAGATGTGTCT 
TCCACCAGCGTCACTGCCCTTCCTTCCAAAGGCCTGGAGCACCTCAAAGAACTGATCGCA 
AAAGACACCTGGACTCTCAAAAAGCTCCCGCTGTCGTTGAGTTTCCTCCACCTCACTCGG 
GCTGACCTCTCTTACCCGAGCCACTGCTGCGCTTTTAAGAACCAGAAGAAAATCAGGGGA 
ATCCTGGAGTCTTTGATGTGTAATGAGAGCAGTATCCGGAACCTTCGTCAAAGGAAATCA 
GTGAACATCTTGAGGGGTCCCATCTACCAGGAATATGAAGAAGATCCGGGTGACAACAGT 
GTTGGGTACAAACAAAACTCCAAGTTCCAGGAGAGCCCAAGCAACTCTCACTATTACGTC 
TTCTTTGAAGAACAAGAGGATGAGGTCGTTGGTTTCGGCCAAGAGCTCAAAAATCCTCAG 
GAAGAGACTCTCCAAGCCTTCGAGAGCCACTATGACTACACGGTGTGTGGGGACAACGAG 
GACATGGTGTGTACCCCCAAGTCGGACGAGTTTAACCCCTGTGAAGATATCATGGGCTAC 
AGGTTCCTGAGAATCGTGGTGTGGTTTGTCAGTCTGCTGGCTCTCCTGGGCAATATCTTC 
GTCCTGCTCATTCTGCTAACCAGCCACTACAAATTGACCGTGCCGCGGTTCCTCATGTGC 
AACTTGGCCTTTGCAGATTTCTGCATGGGGGTATACCTGCTTCTCATTGCCTCTGTAGAC 
CTGTACACACACTCTGAGTACTACAACCACGCCATCGACTGGCAGACGGGCCCTGGGTGC 
AACACGGCTGGCTTCTTCACTGTTTTCGCCAGTGAGTTATCAGTGTACACACTGACGGTC 
ATCACCCTGGAGCGATGGTACGCCATCACCTTCGCCATGCGCCTGGATAGGAAGATCCGC 
CTCAGGCACGCGTACACCATCATGGCTGGGGGCTGGGTTTCCTGCTTCCTTCTCGCCCTG 
CTCCCGATGGTGGGAATCAGCAGCTATGCCAAGGTCAGCATCTGCCTGCCAATGGACACC 
GACACCCCTCTTGCACTCGCATACATTGTCCTCGTTCTGCTGCTCAATGTTGTTGCCTTT 
GTTGTCGTCTGTTCCTGCTATGTGAAGATCTACATCACGGTCCGAAATCCCCAGTACAAC 
CCTCGAGATAAAGACACCAAGATTGCCAAGAGGATGGCTGTGTTGATCTTCACTGACTTC 
ATGTGCATGGCGCCCATCTCCTTCTATGCGCTGTCGGCACTTATGAACAAGCCTCTAATC 
ACTGTTACTAACTCCAAAATCTTGTTGGTTCTCTTCTACCCCCTCAACTCCTGTGCCAAT 
CCGTTTCTCTATGCTATTTTCACCAAGGCCTTCCAGAGGGACGTGTTCATCCTGCTCAGC 
AAGTTTGGCATCTGCAAACGCCAGGCCCAGGCCTATCAGGGTCAGAGAGTCTGTCCCAAC 
AATAGCACTGGTATTCAGATCCAAAAGATTCCCCAGGACACGAGGCAGAGTCTCCCCAAC 
ATGCAAGATACCTATGAACTGCTTGGAAACTCCCAGCTAGCTCCAAAACTGCAGGGACAA 
ATCTC AGAAGAGT ATAAGC AAAC AGC C TTGTAAAGGAAAGGCTACGCTAGTCACAGTGAG 
ACTTACAAAAGGCTGGTTTCTTGAACATGCGTTCCAGTCCCGTGACATGTGAACACATAG 
GTT C ATGCAGGTGATGATTC ATAGGGT CAGAGTTCATCTCTAGAAAGTATTGCCTC 
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Gene Sequence Structure 

* 



91 bp 



Sequence Deleted 



163 bp 



Size of full-length 
cDNA: 2512 bp 




Targeting Vector* (genomic sequence) 



%rm Length: 
5': 5 kb 
V: 1.3 kb 



5^ 



Endogenous Locus 



* Not drawn to scale 



LacZ-Neo 



*. »{ "-.-St ^ 



- C ' .Am >< s **** c 




5 ' >ACTTGAGAGCCTCTCCTTCCC 
CCTCTCCAGCGTGCTCTCCAGCGA 
TGAGGTCACAGCCCCTCGGAGCCC 
TCCTCCTCCCTCCCTTCCCCTCCT 
GCACCCGGGTCTCTTCCAGCGTCA 
GACGCAGGGCACTGAGAATGTGGC 
GACAGCGCGCAACGATGAAGTAGC 
CCAGAGGGTCCCTTGGAAAATGAG 
GCCAGGGTCCCO 1 
(SEQ ID NO: 3) 



5 1 >GTGTCACCAGGAGGACGACTT 
CAGAGTCACCTGCAAGGAGCTCCA 
CCGAATCCCCAGCCTGCCGCCCAG 
CACCCAGACTCTGTGAGTAGCCAA 
GGCCAAGACCCCCCCCCCCCGAGA 
AATTCGTGGTGTGTGTTGGGGTGT 
GCGCGGATATCTGGTCAGTCCCTG 
TACAAATTCAATCCCCCATGCTCG 
GGAAGGTCAGC < 3 1 
(SEQ ID NO: 4) 
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